Risk-Sensitive Markov Control Processes

نویسندگان

  • Yun Shen
  • Wilhelm Stannat
  • Klaus Obermayer
چکیده

We introduce a unified framework to incorporate risk in Markov decision processes (MDPs), via prospect maps, which generalize the idea of coherent/convex risk measures in mathematical finance. Most of the existing risk-sensitive approaches in various literature concerning with decision-making problems are contained in the framework as special instances. Within the framework, we solve the optimal control problems according to two criteria, the newly invented temporal discounted criterion, which generalizes the conventional discount scheme, and the average criterion, by value iteration algorithms under different assumptions. Two online algorithms are proposed to solve the optimal controls problem when the exact MDP is unknown and has to be estimated during optimization.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of a risk-sensitive control problem for hidden Markov chains

In this paper the risk-sensitive control of parially observed Markov decision processes is considered. The replacement problem is analyzed in this context, and the structure of risk sensitive optimal controllers is given.

متن کامل

Existence of Risk Sensitive Optimal Stationary Policies for Controlled Markov Processes

In this paper we are concerned with the existence of optimal stationary policies for in nite horizon risk sensitive Markov control processes with denumerable state space, unbounded cost function, and long run average cost. Introducing a discounted cost dynamic game, we prove that its value function satis es an Isaacs equation, and its relationship with the risk sensitive control problem is stud...

متن کامل

Risk Sensitive Stochastic Control and Differential Games

We give a concise introduction to risk sensitive control of Markov diffusion processes and related two-controller, zero-sum differential games. The method of dynamic programming for the risk sensitive control problem leads to a nonlinear partial differential equation of HamiltonJacobi-Bellman type. In the totally risk sensitive limit, this becomes the Isaacs equation for the differential game. ...

متن کامل

Risk Sensitive Control of Markov Processes in

In this paper we consider in nite horizon risk-sensitive control of Markov processes with discrete time and denumerable state space. This problem is solved proving, under suitable conditions, that there exists a bounded solution to the dynamic programming equation. The dynamic programming equation is transformed into an Isaacs equation for a stochastic game; and the vanishin discount method is ...

متن کامل

Existence of Risk Sensitive Optimal

In this paper we are concerned with the existence of optimal stationary policies for innnite horizon risk sensitive Markov control processes with denu-merable state space, unbounded cost function, and long run average cost. Introducing a discounted cost dynamic game, we prove that its value function satisses an Isaacs equation, and its relationship with the risk sensitive control problem is stu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • SIAM J. Control and Optimization

دوره 51  شماره 

صفحات  -

تاریخ انتشار 2013